Cascaded Modeling for PIMA Indian Diabetes Data
نویسنده
چکیده
This paper develops the cascaded models for classification of PIMA Indian diabetes database. The k-nearest neighbour method is used to impute the missing data and the processed data is used for further classification. This is done in two steps, in first step k-means clustering algorithm is used for extracting hidden patterns in data set then in second step the classification is done by using suitable classifier. k-means algorithm combined with artificial neural network classifier and k-means algorithm combined with logistic regression classifier achieve classification accuracy above 98%.
منابع مشابه
Determinants of Diabetes Mellitus in the Pima Indian Mothers and Indian Medical Students
Diabetes mellitus is a very common and serious disease in many American Indian tribes, Indians, and many other populations in the world. Several well-known risk factors such as parental diabetes, genetic markers, obesity, diet are considered as the main risk factors for diabetes mellitus, while the precise nature of the gene or genes remains unknown. Objectives: The Pimas, Indians, and many oth...
متن کاملPredicting Type2 Diabetes Using Data Mining Algorithms
Background and purpose: Today, information systems and databases are widely used and in order to achieve higher accuracy and speed in making diagnosis, preventing the diseases, and choosing treatments they should be merged with traditional methods. This study aimed at presenting an accurate system for diagnosis of diabetes using data mining and a heuristic method combining neural network and pa...
متن کاملCascading K-means Clustering and K-Nearest Neighbor Classifier for Categorization of Diabetic Patients (IJEAT)
Medical Data mining is the process of extracting hidden patterns from medical data. This paper presents the development of a hybrid model for classifying Pima Indian diabetic database (PIDD). The model consists of three stages. In the first stage, K-means clustering is used to identify and eliminate incorrectly classified instances. In the second stage Genetic algorithm (GA) and Correlation bas...
متن کاملDiagnosis of Diabetes Using an Intelligent Approach Based on Bi-Level Dimensionality Reduction and Classification Algorithms
Objective: Diabetes is one of the most common metabolic diseases. Earlier diagnosis of diabetes and treatment of hyperglycemia and related metabolic abnormalities is of vital importance. Diagnosis of diabetes via proper interpretation of the diabetes data is an important classification problem. Classification systems help the clinicians to predict the risk factors that cause the diabetes or pre...
متن کاملDiabetes Diagnosis by Using Computational Intelligence Algorithms
Diabetes mellitus is a chronic disease and one of the most public health challenges in worldwide. Most of discoveries indicate that the best way to overcome diabetes is to prevent the risks of diabetes before becoming a diabetic. With this idea, we would like to find a way to estimate diabetes risk. Data mining techniques could be used as an alternative way in discovering knowledge from the pat...
متن کامل